CR00274M - Thomas et al. 



1 



METHOD AND DEVICE FOR MULTIPLE INPUT/MULTIPLE OUTPUT 
TRANSMIT AND RECEIVE WEIGHTS FOR EQUAL-RATE DATA 
STREAMS 



FIELD OF THE INVENTION 

In general, the invention relates to the field of communication systems. 
10 More specifically, the invention relates to strategies for wireless 

Multiple-lnput/Multiple-Output (MIMO) communications and in particular, to 
establishing transmit and receive weighting matrices for use at the transmitter 
antenna array and the receiver antenna array. 

15 BACKGROUND OF THE INVENTION 

In a wireless communication system, a major design challenge is to 
maximize system capacity and performance. One such communication 
design known in the art is Multiple-Input/Multiple Output (MIMO), and is a 
means of transmitting multiple data streams on the same time-frequency 

20 channel to a single receiver. The MIMO strategy involves deploying multiple 
antennas on both the transmitter and the receiver. 

In environments having rich multipath scattering, large increases in 
capacity can be achieved through the use of MIMO and appropriate transmit 
and receive signal processing techniques. It is well known that MIMO 

25 wireless channels have significantly higher capacities than single-input single- 
output wireless communication channels, which has motivated the design of 
wireless communication systems with multiple antennas. Current system 
algorithms designed to achieve high capacity include spatial multiplexing, 
space-time coding and adaptive modulation. 

30 These MIMO systems include the various BLAST (Bell-labs LAyered 

Space-Time) type techniques proposed by Lucent as a subset. The BLAST 
type techniques however, do not use the channel knowledge at the transmitter 
and thus are sub-optimal. 
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For Ml MO systems, the maximum theoretical system capacity can be 
achieved using transmit and receive weights based on the Singular Value 
Decomposition (SVD) combined with a water-pouring (optimally assigning 
power to the individual data streams) strategy for determining the optimal data 
5 rates and power distribution. This strategy for maximizing capacity must 
adaptively control not only the number of independent data streams to be 
formed but also the choice of modulation and coding to be used on each 
stream. However, even for full-rank channels (as are known in the art), there 
are cases where a fixed number of data streams having the same modulation 

10 type is desirable, for example to avoid having to make a complex real-time 
decision on both the number of streams and the modulation/coding type on 
each stream. Consequently, using the SVD weights for a fixed number of 
equal-rate data streams is not the best option because fixing the modulation 
type will not take advantage of the unequal signal-to-noise ratios at the 

1 5 outputs of the receive array. 

Therefore, it would be desirable to have a method and device for 
finding transmit and receive weights that are optimized for the use of equal- 
rate data streams. Further, it would be desirable that these weights have a 
lower Bit Error Rate (BER) than weights found using the SVD approach. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is an overview diagram of a preferred embodiment of a wireless 
(cellular) communication system in accordance with the invention; 

FIG. 2 is a block diagram illustrating a preferred embodiment of a 
receiving device imbedded within the communication system of FIG. 1 in 
25 accordance with the invention; 

FIG. 3 is a block diagram illustrating details of an antenna combiner 
imbedded within the receiving device of FIG. 2 in accordance with the 
invention; 
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FIG. 3A is a block diagram illustrating a preferred embodiment of a 
transmitting device imbedded within the communication system of FIG. 1 in 
accordance with the invention; 

FIG. 4 is a block diagram illustrating one embodiment of a transmit 
5 weighting unit imbedded within the transmitting device of FIG. 3A in 
accordance with the invention; 

FIG. 5 is a flow chart representation of one embodiment of a method 
for providing Multiple Input Multiple Output (MIMO) receive and transmit 
weights as performed by the antenna combiner of FIG. 3 and transmitting 
10 device of FIG. 3A, for finding the MMSE linear weights, in accordance with the 
invention; 

FIG. 6 is a flow chart representation of an alternative embodiment of a 
method performed by the antenna combiner of FIG. 3 and transmitting device 
of FIG. 3A, for finding the MMSE successive cancellation weights, in 

15 accordance with the invention; 

FIG. 7 is a flow chart representation of an alternative embodiment of a 
method performed by the antenna combiner of FIG. 3 and transmitting device 
of FIG. 3A, for finding the MMSE successive cancellation weights, in 
accordance with the invention; 

20 FIG. 8 is a flow chart representation of an alternative embodiment of a 

method performed by the antenna combiner of FIG. 3 and transmitting device 
of FIG. 3A, for finding the MMSE successive cancellation weights, in 
accordance with the invention; and 

FIG. 8A is a flow chart representation of an embodiment for the method 

25 of FIG. 8 as performed by the antenna combiner of FIG. 3 and transmitting 
device of FIG. 3A. 
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DETAILED DESCRIPTION OF THE 
PRESENTLY PREFERRED EMBODIMENTS 

FIG. 1 illustrates a wireless communication system 100 in accordance 

with one embodiment of the present invention. As shown in FIG. 1, a base 

5 station 110 provides communication service to a geographic region known as 

a cell 103,102. At least one user device 120 and 130 communicate with the 

base station 110. In one embodiment of the wireless communication system 

100, at least zero external interference sources 140 share the same spectrum 

allocated to the base station 110 and subscriber devices 120 and 130. The 

10 external interference sources 140 represent an unwanted source of emissions 
that interferes with the communication process between the base station 110 
and the user devices 120 and 130. The exact nature and number of the 
external interference sources 140 will depend on the specific embodiment of 
the wireless communication system 100. In the embodiment shown in FIG. 1, 

15 an external interference source will be another user device 140 (similar in 
construction and purpose to user device 120) that is communicating with 
another base station 112 in the same frequency spectrum allocated to base 
station 110 and user devices 120 and 130. As shown in FIG. 1, user devices 
120 have a single transmit antenna 101, while user devices 130 have at least 

20 one antenna 101 . One embodiment of the invention provides that the user 
devices 120 and 130, as well as the base station 110 may transmit, receive, 
or both from the at least one antenna 101. An example of this would be a 
typical cellular telephone. Additionally, one embodiment of the invention can 
be implemented as part of a base station 1 10 as well as part of a user device 

25 120 or 130. Furthermore, one embodiment provides that user devices as well 
as base stations may be referred to as transmitting units, receiving units, 
transmitters, receivers, transceivers, or any like term known in the art, and 
alternative transmitters and receivers known in the art may be used. 

FIG. 2, is a block diagram illustrating one embodiment of a receiving 

30 device 200 imbedded within the wireless communication system 100 of FIG. 
1, in accordance with the present invention. The receiving device 200 
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includes at least one antenna 101 wherein the outputs of the antennas are 
each provided to a receiving unit 201. The outputs of the receiving units 201 
are provided to at least one antenna combiner 202. The signals from the 
receiving units 201 are also fed into a combiner controller 210, which may 
5 regulate the operation of the at least one antenna combiner 202. The signals 
from the receiving units 201 may also be fed into a channel estimation device 
208. A pilot symbol generator 212 generates pilot symbol information that is 
used by the combiner controller 210 to control the antenna combiner 202. 
The pilot symbol information generated by the pilot symbol generator 212 is 

10 also used by the channel estimation device 208 to estimate a time-varying 
frequency response of the transmitting devices of wireless communication 
system. The output of the antenna combiner 202 is fed into an information 
decoding unit 206, which decodes the antenna combiner output 204 and 
generates data information 213 that was received by the antennas 101. 

1 5 FIG. 3 is a block diagram illustrating details of the antenna combiner 

202 of receiving device 200 of FIG. 2. In one embodiment, antenna combiner 
202 can be coupled to the receiving units 201, which in turn are coupled to the 
antennas 101. The receiving units 201 may include radio frequency pre- 
amplifiers, filters, and other devices that can be used to convert a radio 

20 frequency signal received by the antenna 101 , to a digital stream of baseband 
equivalent complex symbols. As illustrated in FIG. 2, the output (y) of the / h 
receiving unit 201 (where / is an integer between 1 and M inclusive, and M is 
the total number of antenna 101 elements) may be mathematically denoted by 
y{k), where k and / are integers, and is provided to the antenna combiner 202. 

25 The antenna combiner 202 can be in the form of at least one complex 

multipliers 302 that multiply the output of each receiving unit 201 by a weight 
(w) 304 mathematically denoted as w{k). A combiner 306 may sum the 
outputs of the at least one complex multipliers 302. For one embodiment of 
the invention, the combiner controller 210 of FIG. 2 controls the values of the 

30 weights 304. 
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FIG. 3A is a block diagram illustrating one embodiment of a 
transmitting device 3A00 imbedded within the wireless communication system 
100 of FIG. 1, in accordance with the present invention. The transmitting 
device 3A00 may include at least one antenna 101 wherein the inputs to the 
5 antennas may be provided from at least one transmitting unit 3A01 , and at 
least one transmit combiner 3A03. The inputs to the transmit combiner 3A03 
may be provided from at least one transmit weighting unit 3A05. At least one 
Information Bit Stream 3A13 may be encoded by an Information Encoding 
Unit 3A07 and converted into a Data Stream 3A10. The Data Stream 3A10 is 

1 0 weighted by transmit weighting unit 3A05 and the outputs of the transmit 
weighting unit 3A05 may be sent to the transmit combiner 3A03. The 
Transmit Weighting Unit Controller 3A11 may be provided with channel state 
information 3A12, which may be used by the Transmit Weighting Unit 
Controller 3A10 to control the operation of transmit weighting units 3A05. 

1 5 FIG. 4 is a block diagram illustrating one embodiment of the 

transmitting weighting unit 3A05 of FIG. 3A, in accordance with the present 
invention. The symbol (x) on data stream / at time k, x,</c) 410 is weighted 403 
by a transmit weight 404. The transmit weight 404 value may be dependant 
on which antenna the transmit weight 404 is associated with. In equation 

20 form, an antenna m's signal (v) for data stream /' at time k 420 may be 

expressed as v/, m (/c)=\// jm (/c)x,</c). The antenna m's signal at time k for data 
stream /' may be sent to the transmit combiner 3A03 for antenna m in order to 
combine all the data streams before radiating the signal out of antenna m. 

FIG. 5 is a flow chart representation of a method for providing multiple 

25 input multiple output MIMO receive weights as can be performed by the 

antenna combiner of FIG. 3, and the transmitting device 3A00, in accordance 
with the invention. 

One embodiment of the invention may transmit and receive N s data 
streams, wherein each data stream can have the same modulation type with 

30 Mr transmit antennas and M R receive antennas. The present invention may 
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provide an alternative to the SVD weights that are known in the art. One 
embodiment of the present invention can minimize the mean square error 
between the receive array outputs and the transmitted symbols. Using the 
SVD weights for a fixed number of equal-rate data streams may not be the 
5 best option because fixing the modulation type will not take advantage of the 
unequal signal-to-noise ratios at the outputs of the receive array. This is 
because the SVD weights diagonalize the channel, which means that there is 
no cross talk between the data streams. Thus MMSE weights based on the 
SVD approach will not take advantage of the performance gains possible by 

1 0 trading off the suppression of other streams with gains over noise. Further, if 
the SVD weights are used while changing each streams' transmitted power so 
that each data stream is received with equal energy, the optimal data stream 
will be penalized while an inadequate data stream is built-up, again indicating 
that this is not the best solution. One embodiment of the present invention 

15 computes linear MMSE weights that are not constrained to diagonalize the 
channel, thereby providing superior performance when all data streams have 
the same modulation type. 

In order to compute the linear MMSE weights in accordance with one 
embodiment of the present invention, a single estimate of the channel gain 

20 and phase between each receive antenna and each transmit antenna may be 
provided to the invention (this is also called the channel). The channel is 
modeled as stationary and flat-faded. As is known in the art, such channels 
occur in systems that include, but are not limited to, single carrier systems 
operating in frequency non-selective channels, OFDM systems in which an 

25 embodiment of the invention operates on a set of frequency-domain 

subcarriers having identical frequency response characteristics, or single 
carrier system in which an embodiment of the invention operates in the 
frequency domain on a set of frequency-bins having identical frequency 
response characteristics, or other similar systems as is known in the art. Also 
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provided is that both the transmitting weighting unit and receiving device may 
know the channel. 

In one embodiment, the MrxMj channel matrix, H, contains the gain 
and phase between each transmit and receive element. Also let the M T xN s 
5 matrix (also known as the transmit weighting matrix), V, denote the transmit 
weights for the N s streams (the u th column of V is denoted v 0 ), and finally let 
the A/ S x1 vector, x, be the symbols on the N s streams (at one particular time). 
Then for this embodiment, a received M«x1 vector can be given as: 

y = HVx + n (1) 
1 0 where n is a Mrx 1 vector of noise with power of al on each of the received 
antennas. In one embodiment, in order to keep an average transmit power 
equal to one, the transmit weighting matrix may be normalized as follows: 

trace(V"V) = 1 (2) 
where trace(A) means the sum of the diagonal elements of the square matrix 
15 A. 

One embodiment of the invention finds the transmit weighting matrix 
and a M R xN s receive weighting matrix, W, such that an estimate of the 
transmitted symbols, r, is: 

r = W H y (3) 
20 (The u th column of W is denoted w u ) In one embodiment of the invention, the 
operation of the invention may be optimized for the case where each of the N s 
data streams is constrained to have the same modulation type. This 
embodiment of the invention minimizes the Mean Squared Error (MSE) 
between the estimated symbol on each stream and the transmitted symbol 
25 assuming linear receive and transmit weights. In equation form, this can be 
expressed as: 
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min 



5>.-x.r 



(4) 



where r u is the i/ th element of the vector r and x u is i/ h element of the vector x. 
Using E{x u x]} = 8{u -£), (4) becomes: 



mm 
w„,v„ 



|w^|hv/H h +( 7„ 2 I 



(5) 



In addition to the minimization of the objective function in (5), the 
constraints on the transmit weight vectors (the columns of the matrix V) given 
in (2) need to be met. One embodiment of the invention meets the constraints 
by modifying the objective function of (5) as follows: 



mm 
w„,v ( . 



-S w ? Hv " -IX H " w " +/v s 

I N I 2 



(6) 



10 where y is an arbitrary scaling placed on the constraint part of the objective 
function. 

A closed form solution to (6) does not exist for A/ s >1 . However, (6) is in 
the form of an unconstrained optimization problem so a modified version of an 
iterative algorithm such as the gradient-based optimization approach is used 
15 by one embodiment of the invention to solve for w u and v^. The gradient of 
(6) with respect to w 0 can be shown to be: 

Vw„ = (hVV h H" + ct 2 I Mr )w„ - Hv u (7) 
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Because (hVV h H" + <t 2 \ m ) is well conditioned, its inverse can be found. 
Therefore W can be found at each iteration step as follows (i.e., by setting 
equation (7) equal to zero): 

W = (hW h H" + <t 2 „\ Mr )" 1 HV (8) 
5 The gradient of (6) with respect to v 0 can be shown to be: 

Vv u =(H H WW H H + 2 r (trace(V H V)-1)l Wr )v u -H H w u (9) 
The invention lets G = [Vv 1 ... Vv Ns ] , and then using equation (9), G can 
be expressed as: 

G = (h h WW h H + 2Ktrace(V H V) - 1)l Mr )v - H H W (1 0) 

1 0 The above embodiment of the invention explicitly adds the constraint 

on the transmit weights in equation (2) to the minimization. Another option for 
ensuring the constraint is to use a Lagrangian multiplier, X, as is known in the 
art. For this method, the gradient matrix in (10) is changed as follows: 
G = (H H WW H H + AI Mj )V-H H W (11) 
15 One addition gradient is needed, the Lagrangian gradient which is: 

g A = trace(V"V-1) (12) 
The previous equations and calculations express a mathematical 
means for a method of providing linear MIMO transmit and receive weights. 
The method of providing MIMO receive and transmit weights, shown in 
20 the flow chart 500, may begin by being provided a channel matrix 510. The 
channel matrix H, is a M R xM T matrix that contains the gain and phase 
between each transmit antenna (M T is the total number of transmit antennas) 
and each receive antenna (M R is the total number or receive antennas). In 
one embodiment of the invention, the received vector y, is modeled as 
25 y(/c) = HVx(/f) + n(/c) , where V is a M T xN s matrix of the transmit weight 

vectors, x(/c) is a N s x^ vector of the transmitted symbols at index k (e.g., time 
or frequency), and n(k) is a M R x1 vector of noise at index k. The embodiment 
of the invention illustrated in FIG. 5 may require the total transmit power to be 
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limited, therefore the transmit weight vectors are constrained to 
trace(V H V) = 1 , where trace(A) means to sum the diagonal elements of the 
matrix A, and superscript H means the conjugate transpose (also known as 
the Hermitian) of the matrix. 
5 Another embodiment of the invention may find a receive weighting 

matrix (also referred to as the receive weight vectors) W, that find an 
estimate, r(k), of x(k) as a function of r(k) = W H y(/c) . Next for the 
embodiment 500, block 520 initializes an iteration number t=0. In one 
embodiment of the invention, transmit weight vectors can be initialized 530 as 

10 V 0 = —X=\ ' Ws , where \ Ns is a N s xN s matrix of all zeros except for the 
A /N s LVt-a'.)*". J 

diagonal elements which are one, and 0 ax b is an axb matrix of all zeros. 
Utilizing the iteration number t, the receive weight vectors W f , are found 540 
as W, = (HV f V f H H H + ct^I Mr ) HV, , where V f are the transmit weight vectors at 
iteration number t, al is the noise power, and \ M r is a M R xM R matrix of all 

1 5 zeros except for the diagonal elements which are one. Again utilizing the 
iteration number t, the gradient matrix Gf, is found 550 as 
G f = (H H W f W f H H + 2 7 (trace(V t w V f ) - 1)l Mr )v f — H H W f , where y is the 
constraint weight (in one embodiment, y=1), and \mt is a M T xM T matrix of all 
zeros, except for the diagonal elements that are one. In another embodiment 

20 of the invention, the gradient matrix is: G, = (h h W,W,"H + Z t \ Mr )V f - H H W t , 
the Lagrangian multiplier at iteration number t can be given as Xf : h^ + ot.gx,t-i 
and the Lagrangian gradient at iteration number t is: g Kt = trace(V f "V ( -1). 

If the embodiment of the invention utilizes an adaptive step size 560, 
the step size a, at iteration time f may be found 565 as the argument that 

25 minimizes the following objective function, C(V ( ,W,,G*,y,a) where 
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C(V„W„G f ,r,a) = W f H H(V f -aG f )(V ( -aG,) H H H W, - W f "H(V, -aG t ) 

-(V, -«G f )"H"W, +?*race((V f -aG,) H (V f -aG,)) 

After a step size has been established, the transmit weight vectors can 
be computed 570 at iteration f+1as a function of the transmit weight vectors at 
5 iteration number t, the step size at iteration number f, and the gradient matrix 
at iteration number f, using the equation V f+1 = V f -aG t . In another 
embodiment of the invention, the Lagrangian multiplier may be updated with: 
X M = l t +ag x t . After computing the transmit weight vectors 570, the iteration 
number is incremented by one 575 as f=f+1. If the iteration number has 

1 0 reached an 'end' value 580, where t end is an integer designating the maximum 
number of iterations, then the receive weight vectors are the receive weight 
vectors at iteration number t, (W=W ( ) 590 and the transmit weight vectors are 
the transmit weight vectors at iteration number f, (V=V f ) and this embodiment 
is completed. If the iteration number has not reached an 'end' value, 585 

1 5 decides if traceCG^G,,., ) < e , and if yes, the receive weight vectors are 

chosen to be the receive weight vectors at iteration number t, (W=W f ) and the 
transmit weight vectors are chosen to be the transmit weight vectors at 
iteration number t, (V=V f ) 590 again completing this embodiment. For 
decision block 585, trac^G^G^) < s means the sum of the diagonal 

20 elements of the square matrix (G^G^and where s is a number indicating 
how small the gradient matrix at iteration number r should get. In other words, 
when trace^^Gj^) < s , the algorithm has almost converged because at the 
optimal solution Gf.f=0. If in decision block 585, trac^G^G^) < s , is false, 
the process recedes to block 540 and the successive cancellation weights 

25 method continues until all values are satisfied by the techniques described 
within flow chart 500. 
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At this point the transmit vectors are, if necessary, renormalized to 
satisfy the constraint in Equation (2). An alternate embodiment of the 
invention is a closed form solution to the linear transmit and receive weights 
that has an equal Mean Squared Error (MSE) on each or the received data 
5 streams. At the receiver, an estimate, of the transmitted symbols, r, can be 
found by using a linear M R xN s weight matrix, W, as follows: 

r = W H y (13) 
An MMSE approach can be used to find W and V as summarized next 
where a closed form expression for W and V are given. Using the SVD, H 
1 0 can be expressed as follows: 

H = U H S H Z H H (14) 
where M R xM R U w and M T xM T Z H are unitary matrices and M R xM T S H is a 
matrix of all zeros except for the upper left r H xr H portion which is: 

[S H ] Ns =diag(s„ s H>f J (15) 
1 5 where r H is the rank of H (rH<min(M Tl M R )), [A] f means the first £ rows and 
columns of A, and it is assumed that s H1 > s H2 > ... > s HfH . The transmit 
weight matrix can also be expressed by a SVD as follows: 

V = U V S„Z{* (16) 
where M T xM T Uy is unitary, N s xN s Zy is unitary, and M T xN s Sv may be a 
20 matrix of all zeros except for the upper N s xN s portion which contains the N s 
non-zero singular values of the transmit weight matrix as follows: 

[S v ] Ns =diag(s^,...,s^ s ) (17) 
One way to find the components of (16) may be to set \Jv=Z H and the 
singular values can be found as the solution to: 

25 s 2 =^=<r n s H \ -cr 2 n s~ 2 e (18) 

v,t ^ H,e 

where X is chosen so that the following equation is satisfied (this forces the 
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transmit weights to have unit power): 

IX. =1 (19) 

The MSE averaged across the data streams may be unchanged 
regardless of the choice of the right singular vectors of the transmit weight 
5 matrix. One option is to use the identity matrix (i.e., Zv=l), however in general 
this choice gives unequal MSE on each data stream as will be seen. This 
means that the BER performance can be dominated by the data stream with 
the highest MSE. Thus if some unitary Z v can be found that gives equal MSE 
on each stream, the BER performance will improve while the average MSE 
10 will remain unchanged. 

Using \J V =Z H , the received data vector from (1) can be expressed as: 
y = U„S„ZEZ„S v Z?x + n 
= U H S H S v Z?x + n 
Let /V s x1 Y Ws = {U"y} Ws where {a}, means the first / elements of the 
vector a, then Y Ns is: 

Y Ns = {UjJU H S H S v Zl'x + U» A/> 
15 5 = {S H S v Z?x} Ns +{U H H n} Ns S (21) 

= DZ?x + N w 

where D is a real N s xN s diagonal matrix equal to diag(s w ,iS\/,i,...,s WjWs S\/ ; /vs) 
and N/vs has a covariance matrix equal to <J 2 n \ Ns . (A unitary matrix times a 
vector of uncorrelated Gaussian random variables does not change its 
covariance matrix). 
20 The MSE on stream I can be shown to be: 

MSE e = 1 - Zy ( DZ v/ (22) 
where Z v , e is the £ th column of Z v , and 

D = D 2 (D 2 +<7„ 2 ir 1 (23) 
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Thus unless all of the diagonal elements of d are equal, using Zv=\ 
gives unequal MSEs on each data stream. The MSE averaged across all 
streams can be shown to be: 

MSE = 1 - trace(D) / N s (24) 
5 Therefore in order to make all data streams have the same MSE, Z v>t 

must satisfy: 

Z H VI DZ VI = 1 - MSE = trace(D) /N s 
subject to Z V Z\} =ZyZ y =\ Ns 
One type of matrix that solves (25) is the DFT matrix (another choice 
for when N s is a power of four is the Hadamard matrix where the Hadamard 
1 0 matrix is known in the art). Thus the linear transmit weights that minimize the 
average MSE on the data streams and also have the same MSE on each 
stream are: 

V = V H S v F Ns (26) 
where Sy is given by (17) and (18) and F Ns is the /V s xA/ s DFT matrix 
1 5 normalized such that F^F Ws = I . With the transmit weights chosen according 
to (26), the N s streams can be recovered without any inter-stream interference 
as: 

r = F£D"X s (27) 
Another embodiment of the invention provides an alternative approach 

20 to calculating transmit and receive weights for MIMO equal rate data 

streamsand is illustrated as FIG. 6. The embodiment illustrated as flow chart 
600, finds the successive cancellation weights that minimize the MSE 
between the estimated symbols and the true symbols on each stream. The 
advantage of successive cancellation weights over linear weights is that some 

25 of the receive array's degrees are freedom are recovered by sequentially 
subtracting out the estimated contribution from the decoded streams. For 
example if there are three streams and three receive antennas, when 
decoding the first stream, two degrees of freedom in the array are needed to 
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null the other two streams. For the second stream, however, only one degree 
of freedom is needed to null the third stream since the decoded first stream is 
subtracted out. The extra degree of freedom then can be used in a max-ratio 
sense to provide an additional gain against noise. Finally, the third stream 
has all of the degrees of freedom in the array available to provide a gain 
against noise. 

In one embodiment, for the successive cancellation method, the 
estimated symbols are given as: 

r u =w"^y-JjHv^j, where x t =slice(r e ) (28) 

where in one embodiment slice(r e ) means to choose the closest signal 
constellation point to r t . In another embodiment, sliced) refers to any 
technique that involves decoding a plurality of received symbols r t into a bit 
stream, and then mapping the resulting bit stream to a signal constellation 
point. 

The MMSE successive cancellation weights can be found as the solution to 
the following minimization problem: 
min 



(29) 



Note that (29) differs from (4) due to the inner sum being performed from u to 
N s instead of 1 to N s as is done in (4). Using E{x u {n)x;(n)} = 8{u -I) , (29) 
becomes: 



mm 



-f]w^Hv u -f>?H"w u +N S 



(30) 



In addition to the minimization of the objective function in (30), the constraints 
on the transmit weight vectors given in (2) need to be met. One embodiment 
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of the invention meets those constraints by modifying the objective function of 
(30) as follows: 



15 



2 wi ^vX+^Vk 



(31) 



W„,VJ ^ u u A. u us 

-r|Sv> 0 -1| 

where y is an arbitrary scaling placed on the constraint part of the objective 
function. 

A closed form solution to (31) does not exist for N s >1 , therefore a 
modified version of an iterative algorithm, such as a gradient-based 
optimization approach, is used to solve for w 0 and v„. The gradient of (31) 
with respect to w u is: 

V Wu = (j]Hv,<H" jw„ -Hv u (32) 

Equation (32) cannot be put into the same form as (8) because of the sum 
from u to N s in the parenthesis. Setting equation (32) equal to zero, w 0 can be 
found at each iteration as: 

w, =^Hv,v^H H +crX R j Hv„ (33) 

The gradient of (31) with respect to is: 

Vv « =^H H w i w^H + 2 r (trace(V H V)-1)l Mr jv u -H H w„ (34) 

In another embodiment, the gradient is computed using Lagrangian multipliers 
as follows: 



Vv u = |£h h w / w?H + >*V Jv u -H H w„ (35) 
20 One addition gradient is needed, the Lagrangian gradient which is: 
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g A =trace(V H V-1) (36) 
The Lagrangian gradient is needed in order to update the Lagrangian 
multiplier, X. 

The method for calculating the successive cancellation weights is 
5 described in FIG. 6 as a flow chart representation 600 of a preferred 

embodiment of an alternative method performed by the antenna combiner of 
FIG. 3 and the transmit device of FIG. 3A, for finding the MMSE successive 
cancellation weights, in accordance with the invention. FIG. 6 begins by 
being provided 610 a channel matrix H that is a M R xM T matrix which contains 

1 0 the gain and phase between each transmit antenna (M T is the total number of 
transmit antennas) and each receive antenna {M R is the total number or 
receive antennas). The received vector, y, is modeled as 
y(k) = HVx(fc) + n(/c) , where for one embodiment of the invention, V is a 
M T xN s matrix of the transmit weight vectors, x(/c) is a A/ S x1 vector of the 

15 transmitted symbols at index (e.g., time or frequency), k, and n(/c) is a M R x1 
vector of noise at index k. Because the total transmit power has to be limited, 
in one embodiment of the invention, the transmit weight vectors may be 
constrained as trace(V H V) = 1 , where trace (A) means to sum the diagonal 
elements of the matrix A, and superscript H means the conjugate transpose 

20 (also known as the Hermitian) of the matrix. 

An additional embodiment of the invention finds receive weight vectors 
which are the columns of the receive weight matrix, W, that can be used to 
find an estimate, r(k), of x(/c) as 

r u (k) = w^y(k)- ! ^Hv ( x e (k)j, where x e (k) = slice(r e (k)) and r u (k) is the u th 

25 element of r(k) slice(r e (k)) may mean to choose the closest signal constellation 
point to r ( (k). 

In another embodiment of the invention, the method of FIG. 6 can 
initialize an iteration number t, to zero 620, and the transmit weight vectors 
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630 may be initialized as V 0 = -=• 




where \ Ns is a N s xN s matrix 



of all zeros except for the diagonal elements which are one, and 0 axb is an axb 
matrix of all zeros. With the initialization complete, the method of FIG. 6 
computes the receive weight vectors W ( at iteration number f 640, are found, 



5 for w=1 to N s , as w, u = ]THv (f v",H H + <x„ 2 I Mr Hv f,u where w tu is the u xl 



column of \N t , v tjU are the transmit weight vectors at iteration number f, <7 2 n is 
the noise power, and \ MR is a M R xM R matrix of all zeros except for the 
diagonal elements which are one. 

Flow chart 600 proceeds to compute the gradient vectors at iteration 
10 number f 650. The gradient vectors, g t ,u, are found using the equation 

g tu = ^H H w f/ w^H + 27(trace(V f H V f )-1)l Mr jv f0 -H H w f u , where y can be 

the constraint weight (in one embodiment of the invention, y=1), and Imt is a 
M T xM T matrix of all zeros, except for the diagonal elements which are one. In 
another embodiment of the invention, the gradient vectors may be found 

15 accordingto: g tu = ^H H w f/ w^H + ^l Mr jv,„ -H H w tu . It is by computing 

the gradient vectors, g t , u , instead of the gradient matrix, G f , that distinguishes 
the successive cancellation weights method of FIG. 6 from the linear MMSE 
weights method of FIG. 5. 



20 iteration time f may be found 665 as the argument that minimizes the following 




If a step size has not been calculated prior to 660, the step size a, at 
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objective function, C(V f ,W t ,G f ,y,a) where 

C(V,,W ( ,G ( ,r,a) = j-£w^H(v fiJ -«9f,u)-S( v f.« -«9f, u ) HHHw f," +N s 

After a step size has been established, the transmit weight vectors are 
computed 670 at iteration f+1 as a function of the transmit weight vectors at 
iteration number t, the step size at iteration number t, and the gradient vectors 
at iteration number t, using the equation v m „ = v f w -ag tu , where t/=1 to N s . 
In another embodiment of the invention, the Lagrangian multiplier is updated 
with: Ji M =l t +ag xt . 

After computing the transmit weight vectors 670, the iteration number 
may be incremented by one 675 as t=t+1 . If the iteration number has reached 
an 'end' value 680, where t en d is an integer designating the maximum number 
of iterations, then the receive weight vectors may be chosen to be the receive 
weight vectors at iteration number t, (W=W f ) and the transmit weight vectors 
may be chosen to be the transmit weight vectors at iteration number t, (V=V f ) 
690 completing this embodiment. If the iteration number has not reached an 
'end' value, 685 decides if trace(G" G M ) <e , and if yes, the receive weight 
vectors are the receive weight vectors at iteration number r, (W=W f ) and the 
transmit weight vectors are the transmit weight vectors at iteration number f, 
(V=V f ) 690 again completing this embodiment. For decision block 685, 
trace(G" G,^) < e means the sum of the diagonal elements of the square 
matrix (G" G f _,)and where s is a number indicating how small the gradient 
matrix at iteration number t should get. In other words, 
whentrace(G^ 1 G,_ ] ) < s , the algorithm has almost converged because at the 
optimal solution, G t .i=0. If in decision block 685, traceCG^G,^) < s , is false, 
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the process recedes to block 640 and the successive cancellation weights 
method continues until all values are satisfied by the techniques described 
within flow chart 600. At this point the transmit vectors may be renormalized, 
if necessary, to satisfy the constraint in Equation (2). 
5 FIG 7 shows a flowchart of another embodiment of the invention 700. 

In this embodiment the gradient matrix or vectors may never be calculated, 
but the transmit and receive linear and successive cancellation weights can 
be found through means of iteration. Initially, 710 one embodiment of the 
invention may be provided a channel matrix. The iteration number, 1 720, may 
1 0 next be set to 0 and the transmit and receive vectors can be initialized 730. In 
one embodiment of the invention, the transmit and receive vectors can be 
initialized to: 

v -^L[ lw < l 

For the linear weights, the receive weights are initialized to: 
15 W 0 = (HV 0 V 0 "H h + ctX r ) 1 HV 0 

For the successive cancellation weights, the receive weights are initialized to: 

w o, u =[SHVo,v^H H + c7 n 2 l MR ] Hv 0iU 

Additionally the objective function at iteration 0, Co, is set to an arbitrary large 

number. (In one embodiment C 0 =infinity.) 
20 The transmit weight vectors at iteration number f+1 may next be computed 

740 as a function of the transmit weight vectors at time t, the receive weight 

vectors at iteration number t, and the channel matrix as follows (where y is the 

constraint weight): 

For the linear weights: 
25 V f+1 = (H H W,W f H H + 2/(trace(V, H V f ) - 1)l Mr )" 1 H H W f 

For the successive cancellation weights: 
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v t+xu =^H H w„w^H + 2Htrace(V f H V,)-1)l Mr J H H w fu 

Next, the receive weight vectors at iteration f+1 may be computed as a 
function of the transmit weight vectors 750 at iteration number f+1 , the noise 
power, and the channel matrix as follows: 
For the linear weights: 

W f =(HV f V f "H" +<rX) -1 HV f 
For the successive cancellation weights: 

w tu =[XHv t /<H H +<T n 2 l M( 



The objective function at iteration number f+1 may next be computed 760 as a 
function of the transmit and receive weight vectors at iteration number f+1 , the 
noise power, and the channel matrix as follows: 
For the linear weights: 

C M =N S +W£ 1 H(V f+1 )(V f+1 ) H H"W f+1 -VO(V f+1 ) 

- (V f+1 ) H H" W f+1 + ?trace((V f+1 )" (V f+1 )) 

For the successive cancellation weights: 



Z < 1, \JL H(v MM )(v M , u ) H H H + aX R y M , u 
- J w^H^^ ) - f> f+1 , 0 ) H H H w f+1i(/ + N s 

U=1 0=1 

I N I 2 

+ rK(v f+1 ,J H (v f+1 , u )-i 



Next, 770 if the objective function at iteration f is less than the objective 
function at iteration f+1, then the receive weight vectors may be the receive 
weight vectors at iteration number f, (W=W f ) and the transmit weight vectors 
may be the transmit weight vectors at iteration number t, (V=V f ) 790 
completing this embodiment. If the objective function at iteration t is not less 
than the objective function at iteration f+1, then in 775 the iteration number, f, 
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can be incremented by one. If the iteration number has reached an 'end' 
value 780, where t en( j may be an integer designating the maximum number of 
iterations, then the receive weight vectors may be chosen to be the receive 
weight vectors at iteration number f, (W=W f ) and the transmit weight vectors 
may be chosen to be the transmit weight vectors at iteration number f, (V=V f ) 
790 completing this embodiment. If the end value is not reached, then the 
iterative procedure returns back to 740 and performs another iteration. After 
the iterations are completed, the transmit vectors may need to be 
renormalized to meet the constraint in Equation (2). 

A further embodiment of the invention may provide for the successive 
cancellation weights to find weights that maximize the theoretical capacity as 
described next. The capacity of the channel for a given transmit weight 
matrix, V, can be shown to be: 



C = log 2 de|l + ^HVV H H H 
Using (14) and (16), the capacity equation in (37) becomes: 
C = log 2 ^det|l + U H S H ZXSv z " z </ s £ u " z * s M 
Using Z"Z V = \ and making U V =Z H , (38) becomes: 



C = log 2 



det 



r u„s H s„sz;si;u" 



(37) 



(38) 



(39) 



In other words, the capacity is independent of the right singular vectors 
of the transmit weight matrix. However, even though the capacity is not 
affected by Z v , different Zy's will greatly affect the performance of a practical 
receiver depending on what type of receive algorithm is employed (e.g., 
successive cancellation or linear weights). Therefore a search can be 
performed only over Z v and the resulting weights will not change the 
theoretical capacity but will improve the receiver performance given the 
algorithm employed. 
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To maximize the capacity, the r(r*N s in general) singular values for the 
transmit weight vectors are selected according to the water-pouring strategy 
as known in the art with a total transmit power of one (this enforces the 
constraint trace(V"V) = 1 ). If the water-pouring strategy says to transmit on 
5 more streams than N s , then the singular values of the transmit weight vectors 
are selected according to the water-pouring strategy using only the largest N s 
singular values of H. 

To summarize, the transmit weights are expressed as: 

V=Z H S V Z% (40) 
1 0 Therefore the received data vector can be expressed as: 

y = U„S H Z2Z H S v Z£x + n = U H S„S v Z£x + n (41) 
Let rxl Y r = {U£y} r where {a}, means the first / elements of the vector 
a, then Y r is: 

Y r ={U2U H S„S v Z?x + U£n} r 

= {S H S„Z£x} r +{U» r =DZ?x + N r 
15 where the diagonalized channel, D, is a real rxr diagonal matrix equal to 
diag(SH,iSv,i,...,SH,rS\/,r) and N r has a covariance matrix equal to cr n 2 l r . Note 
that s H ,r*0 because the water-pouring strategy would never dictate sending 
power on a stream with a singular value of zero. 

For the Successive Cancellation MMSE weights, in order to maximize 
20 the capacity, r must equal the number of streams that the water-pouring 

strategy dictates and r must be less than or equal to N s . Assuming this is true 
(i.e., r<N s ), the Successive Cancellation MMSE weights that maximize 
capacity are found by solving: 



mm 
W, 



n £ 



_ min 
= W,T£? 

subject to : 



,?[ Yr - D |t p x p 



(43) 
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where the rxN s right singular matrix, T, is T = [t,,...,^ ] , U through t Ns are the 
right singular vectors, T = Z£ , and M R xN s W = [w 1 , w w ] 

It can be shown that the receive weight vector for stream I is given by: 

w, =[o|;t p t£D + <x n 2 l r j Dt, (44) 

Ignoring the constraint for now, the gradient of the objective function in 
(44) with respect to (the £ th column of T) can be shown to be: 

Vt, = w p WpDt f -Dw f (45) 

p=i 

Using (45), T can be found through a gradient search where the 
constraint of TT H =I is enforced at each step. This procedure is described in a 
flow chart representation 900 of a preferred embodiment of the invention 
illustrated in FIG. 8. 

The embodiment of the invention illustrated in FIG. 8 may be provided 
a channel matrix, H (910). Next the iteration number, m, can be set to 0 
(920). The right singular matrix at iteration number 0, T 0 , may next be 
initialized by choosing any arbitrary r*N s matrix such that T 0 T 0 H = I (930). 
Next, the receive weight vectors at iteration number m, w m , fI can be computed 
as a function of the diagonalized channel, D, and the right singular vectors at 

iteration number m, t m ,„ (940) as w mJ = ^Dgt m p t^ p D + ^lj Dt m/ for 

£=!,..., N s . Then the gradient vectors at iteration number m, g mj , may be 
computed as a function of the diagonalized channel, the receive weight 
vectors at iteration number m, and the right singular vectors at iteration 

number m (950) as g m , = D^w mp w^ p Dt m , -Dw ro( for €=1,...,/V S . If a step 

size, a, has not been calculated prior to (960) then a step size can be 
calculated that will minimize the objective function in (43) subject to T m T^ = I 
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(965). The next step may compute the right singular vectors at iteration 
number m+1 as a function of the right singular vectors at iteration number m, 
the step size at iteration number m, and the gradient vectors at iteration 
number m (970) as t m£ =X mA -aQ m , e for ^=1,...,A/ S . The iteration number may 
then be incremented by one (975) as m=m+1. Then (977) the right singular 
matrix at iteration number m, T m , is formed by concatenating the right singular 
vectors at iteration number m together (T m = [t m1 |,...,| t mW J ) and a Gram- 
Schmidt orthogonalization (as known in the art) can be performed on the rows 
of the right singular matrix at iteration number m (this enforces the constraint 
T m T„ = I ). The flow chart representation 900 continues as an embodiment of 
the invention in FIG. 8A. 

In FIG. 8A, if the iteration number has reached an 'end' value 980, 
where m en d may be an integer designating the maximum number of iterations, 
then the right singular matrix may be the right singular matrix at iteration 
number m, (T=T m ) and the receive weight vectors may be chosen to be the 
receive weight vectors at iteration number m, {w t =w m , e ) 990 completing this 
embodiment. If the iteration number has not reached an 'end' value, block 
985 can decide if trace(G^G m ) < s where G m =[g m1 Q m , N J, and if yes, the 
receive weight vectors are chosen to be the receive weight vectors at iteration 
number m, (w^=w m , f ) and the right singular matrix is chosen to be the right 
singular matrix at iteration number m, (T=T m ) 990 again completing this 
embodiment. For decision block 985, trace(G^G m ) means the sum of the 
diagonal elements of the square matrix (G^G m )and where s is a number 
indicating how small the gradient matrix at iteration number m should get. In 
other words, whentrace(G^G m ) < e , the algorithm has almost converged 
because at the optimal solution, G m =0. If in decision block 985, 
trace(G^G m ) < s , is false, the process recedes to block 940 of FIG. 8, and the 
successive cancellation weights method continues until all values are satisfied 
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by the techniques described within flow chart 900. After completion of the 
technique described within flow chart 900, the transmit weight vectors may be 
given as V = Z H S V T . 

The present invention may be embodied in other specific forms without 
5 departing from its spirit or essential characteristics. The described 

embodiments are to be considered in all respects only as illustrative and not 
restrictive. 
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